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In  course  of  our  researches  on  partial  vapor  pressures  and  the 
theory  of  dl3tillation,-f-my  students  and  myself  had  frequent  occasion 
to  use  the  liethod  of  Least  Squares,   To  facilitate  the  extensive  com- 
putations involved,  I  devised  a  simplification  and  calculated  a  nvimber 
of  a-uxiliary  formulae,  which  ma^^  save  much  superfluous  labor  to  others 
and  are  therefore  reproduced  in  the  following  pages. 

In  the  mathematical  treatment  of  scientific  results  labor  is  often 
wasted  on  a  degree  of  precision  in  e^zcess  of  the  accuracy  of  the  re- 
sults themselves.   For  instance,  two  experimental  figures,  6,7893 
and  3,4578,  involving  an  error  of  at  least  1  part  in  35,000,  might  be 
multiplied  with  arithmetical  rigor  to  obtain  the  product  23,47604154 
implying  an  error  of  1  part  in  t\70  billion.  An  observer  of  experience, 
aiming  merely  to  keep  his  mathematics  within  the  limits  of  his  experi- 
mental errors,  would  multiply  the  tv/o  figures  in  some  such  v/ay  as  this: 


6, 

;7893 

3, 

,4578 

go. 

,367^ 

2 

7157 

3395 

475 

54 

23.4760 
In  the  product,  written  23,476,  the  multiplication  error  of  4  in  two 
million 
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would  be  negligible  •ompared  with  the  experimental  error  of  the  multi- 
plier. 

In  the  vise  of  the  Least  Squares  this  type  of  simplification 
must  not  be  employed  without  alert  v/atchfulness,  matters  being  compli- 
cated by  the  additions  and  subtractions,  by  which  the  relative  errors 
are  liable  to  be  greatly  magnified.   The  semi-graphic  procedure  rec- 
ommended below,  vaguely  analogous  in  that  it  too  aims  merely  to  keep 
the  mathematical  errors  within  those  of  the  e^iperiments  to  be  repre- 
sented, will  be  found  acctirate  enough  for  all  ordinary  purposes  and 
safe,  Tne   procedure  is  based  on  the  substitution  of  carefully  inter- 
polated figures  for  the  a»tual  results  of  observation. 

The  given  experimental  results  are  plotted  on  accurately  ruled 
millimeter  paper,  the  scale  large  enovxgh  to  show  the  likely  errors 
of  observation  or  experiment,   A  smooth  curve  is  drawn  free-hand  tc 
represent  the  trend  of  the  points  as  closely  as  the  eye  v;ill  allow; 
or  else  a  neat  wavy  curve  is  dravm  through  the  points  themselves.   In 
some  cases,  if  the  points  are  more  or  less  evenly  thrown  by  the  errors 
to  the  one  and  the  other  side  of  any  curve  on  which  they  might  belong, 
neighboring  points  may,  for  the  purpose  of  interpolation,  be  con- 
nected by  straight  lines,  li^om  the  curve,  or  from  the  broken  line, 
we  read  the  ordinates  corresponding  to  a  set  of  uniforml^r  increasing 
abscissas  to  v/hich  are  assigned  the  values  x  a  0,  1,  2,  3,  4,  ,,,9, 
10;  or  some  similar  set.   Still  further  simplification  comes  of 
assigning  to  the  abiscissas  the  values  (:n:-5)  s  -5,  -4,  -3,  -2,  -1,  0, 
1,  2,  3,  4,  5. 

These  ordinates  yield  £;y,  and  simple  further  calculation  leads 
to  the  values  of  ^xy,  X^c  7*  etc.   The  equations  that  v/ould  ordinarily 
result  have  been  solved  by  me  in  advance  for  the  coefficients  a,  b,  c, 
••.  of  a  series  of  equations  of  the  form 

y  s  a  -«-  bx  -I-  cx^  4-  , , , , 
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Prom  the  solutions  given  belov/,  the  coefficients  a,  b,  6,  4**.  may  be 

obtained  imraediately  by  substltutlns  the  Values  of£y,  2Ixy,  etc*   The 

final  nximerlcal  coefficients  a,re  yielded  by  transforming  the  arbitrary 

X  S  1,  2,  3,  ,,,,  or  (  x-5  )  s  «5,  -4,  -3,  ..*,,  into  the  ^iven  values 

of  X, 

For  the  benefit  of  less  experiencei  computers  it  may  be  pointed 
out  that  it  is  a  little  easier  to  multiply  y  times  x,  than  yx  times 
X,  yx^  tiiiies  x,  ,,.,  than  y  times  x^,  y  times  x^,  etc, 

A  simple  example  will  illustrate  the  procedure  recomiaended  and 
the  closeness  of  its  results  to  those  of  the  direct  procedure  in 
general  use.  For  a  given  act  df  ten  observations,  recorded  in  Table 
I,,  let  y  s  a-fbx,  and  say  that  the  observed  values  of  7  correspond 
to  X  2  0.5,  1.5,  2,5,  ,,.,  8.5,  9,5, 

In  general,  the  liethod  of  Least  Squares,  applied  to  a  linear 
relationship,  yields  the  following: 

Formulae  for  Calculating^  the  Coefficients  of  y  «  a  +bx.  Based  on  n 
Observations : 


a  «^x  Xy  ^IT^Xxy     1 


(1) 

b  s  n  ^xy-  2'x  ^y 
nZxS-  (rx)2 

For  our  given  observations  these  formulae  lead  to  the 

equation  '  '      

7  =  3,0068181  -f-  l,9936364x (A) 

In  Table  I,  the  first  two  columns  record  the  observed  dataj  the  third 
gives  the  values  of  y  calculated  by  equation  (A);  the  fourth  gives  A,  , 
the  differences  between  the  calculated  and  the  observed  va.lues  of  7, 


>  « 


X^i^'XO    Qiiiil. 


*    r 


.  I' '. 


•"■■  ''ir 


>.   V  >■  «    ' 

^  t  •  %  ■    ^  ,:>- 


e  '»"- 


,.ald*- 


^,t««>'; 


BV    ft- 


*'j3rf:t'  r^'^''  ■  ?>?;i»' "  ? y cf  >  »^' '  - 


a-*-  ;^   :^ 


■•"•  •■  •     ■■.-.i 


J*-iL»«4k4l4<l;*.      B*ft4 


'       '      ' ■ W      « 


»  •     «      X     •  , 


■  ;  \      .   A  ■ 


-rl'  -r  1  -VT 


s    ^i  . 


.A. 


whloli  yield  the  mlnimxim:^-^'^  b  0,168.  An  additional  fifth  column  shows 
the  differences  percent,  - 


Table 

I. 

X 

yCobs, ) 

yCoalc,  ) 

A 

(calc-obs. ) 

6,fo 

0.5 

4.10 

4.0056363 

-0.0963637 

-2,35 

1.5 

6.15 

,  5,9972727 

-0.1527273 

-2,48 

2.5 

7.80 

7.9909091 

-V-0. 1909091 

-^.45 

3,5 

9.85 

9,9845455 

f 0,1345455 

41.37 

4,5 

12.10 

11,9781819 

-0.1218181 

-1,01 

5,5 

13.80 

13,9718183 

+0,1718183 

-^.1,25 

6,5 

15, 9t 

15,9654547 

-^0,0654547 

4-9.41 

7,5 

18.05 

17,9590911 

-0,0909089 

-0,50 

8,5 

20.10 

19,9527275 

-0,1472725 

-0,73 

9,5 

31.90 

21.9463639 

-h).  0463639 

f0,21 

V/e  now  employ  the  indirect  procedure.   The  observations  are  plotted 
©n  a  scale  where  50  mm.  represent  one  unit  of  x,  and  20mm,  one  •unit  of 
y,   Nelgliboring  points  are  connected  by  straight  lines.   The  arbitrary 
values  X  a  1,  2,  3,    ,,,,  8,  9,  and  the  corresponding  values  of  y  read 
from  the  broken  line  are  given  in  the  first  two  columns  of  Table  II, 
Prom  these  we  get^y  «  116,90  and  ^ xjf^   704,35,  which  yield  imi'iiediately 
the  coefficients  a,  b,  of  the  required  equation  by  substitution  in  the 
following  formulae: 


Formulae  for  Caloulating  the  Coefficients  of  y  a  a-|--bx.  Based  on  Nine 

Points; 

X  3  1,  2,  3,  ,..,  8,  9, 
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We.  thxis   obtain  the   equation: 
7  =     3.0013889  4-  l,9975000x 


(2) 


(B) 


7 

x(©bs. ) 

0,5 

5.15 

' 

1.5 

7.00 

2.5 

8.85 

3.5 

11.00 

4.5 

12.95 

5,5 

14.85 

* 

6.5 

17,00 

7.5 

19.10 

' 

8.5 

21.00 

9.5 

Tr.W.e   II 
y(obs,)  7(calc,  )      A-j_(calc. — obs,)^^^ 

4.0001389        -0.0998611  -2,41 


4.10 

6.15 

7.80 

9,85 

12.10 

13,80 

15.90 

18.05 

20.10 

21.90 


5,9976389 
7.9951389 


-0.1523611 
-1-0.1951389 


9.9926382  -)-0, 1426389 

11.9901389  -0,1098611 

13.9876389  -1-0.1876389 

15.9851389  -^  0,0851389 


17,9826389 
19.9801389 


-0.0673611 
-0,1198611 


21.9776389   f 0.0776389 


-2.48 
-f-2.50 
+1,45 

-0.91 
-1-1.36 
-^0.54 

-0.37 

-0.60 
-\-0.35 


The  third  and  fourth  columns  of  Table  II,  reproduce  again  for 
comparison  the  "observed"  values  of  x  and  yj  the  fifth  gives  the  values 
of  y  taltulated  by  equation  (B);  the  sixth  shows^  g,  the  differences 
betv/een  these  calculations  and  the  observations  and,  again,  the  last 
column  shows  the  differences  percent. 
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Plainly,  the  indirect  procedure  and  equation  (B)  reproduce  the 
results  all  but  as  well  as  the  usual  direct  procedure  and  equation  (A). 
The  sxim  of  the  squares  of  the  differences  between  the  calculated  and 
the  observed  values, Z^A  2  "  0,171,  is  very  close  to  the  minimum, 
V  A  .  S  0,168,  rf  the  direct  procedure. 

In  place  of  the  nine-point  formulae  (1),  the  following  based  fn 
eleven  points,  will  be  found  more  convenient  in  some  oases: 

Formulae  for  Calculating:  the  Cp^effioients  of  y  "  a  f-bx.  Based  on 
Eleven  Points ;   x  "  0,  1,  2,  3,  ,,,,  8,  9,  10, 


(3) 


^»5^y-^X3cy 


110 
Applying  these  formulae  to  our  test  case,  we  obtain  the 


equation: 


y  =  3, 00454545 -HI. 99636364X ,  (C)  5 

Table  III,  shows  the  results.   The  first  two  columns  reproduce  ; 

once  more  the  "observed"  values  of  x   and  y.   The  third  gives  the  values  i 
of  y  calculated  by  equation  (C),   The  fourth  and  fifth  show  the  dif f er- j 

ences  between  the  calculated  and  the  observed  values,     ..  ' 

X  2  I 

Here  the  sum  of  the  squares  of  the  differences,£A  ^  S  0,169,  is  ■ 

even  closer  to  the  minumum<EA   s  0,168,  yielded  directly  by  the  ob-  \ 

servations,  ; 

The  differences  between  the  values  of  y  from  our  indirectly  ; 

gotten  equations  (B)  and  (C)  and  those  from  equation  (A)  based  im-  ^ 

mediately  on  the  observations,  are  small  compared  with  the  errors  of-  ; 

the  observations  themselves,  | 
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Table 

III. 

X 

yCobs, ) 

yCcalc.  ) 

A.    (ealc,-»-obs, ) 
i 

A,^ 

©.5 

4.10 

4,00272727 

-0.09727273 

-2.37 

1.5 

6,15 

5,99909091 

-0.15090909 

-2.45 

2.5 

7,80 

7,99545455 

+0,19545455 

+2.51 

3,5 

9.85 

9.99181818 

-f  0,14181818 

+1,44 

4.5 

12,10 

11,98818182 

-0,11181818 

-0,92 

5.5 

13,80 

13,98454545 

f 0,18454545 

■+1.34 

6.5 

15.90 

15.98090909 

+0,08090909 

+0.51 

7.5 

18.05 

17.97727273 

-0, 07272727 

-0.40 

8.5 

20,10 

19.97363636 

-0,12636364 

-0.63 

9.5 

21,90 

21,97000000 

-+0,07000000 

+0.32 

Below  are  given  several  sets  of  formulae  for  calculating 
the  coefficients  of  parabolic  equations  ©f  the  second,  third,  and  fourth 
degrees,  v/hich  will  suffice  to  meet  most  ordinary  needs.   In  using 
these  formulae  the  number  of  significant  figures  in  the  products  in- 
volved should  only  be  reduced  with  great  caution  (if  at  all). 


Formulae  for  Calculating;  the  Coefficients  of  y  ■  a  f  bx  +cx  ,  Based  on 
Nine  Points;   x  s  0,  1,  2,  3,  .....  7,  8. 

a  5  +3052ry  -  1428Xxy  -+140^xgy 

~  4ggO  " 


b     S     -1428X"y  -|-1057£'xy  -  120^x^y 

4620 


(4) 


0     »    -l-140^y  -  120jlxy-/-15irx^y 

4620 


If  the  parabola  is  to  pass  through  the  origin,  then 
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a  5     0 

660 


(V) 


c  =     ,i9Zy-\-5rxy 
660 


Fe^rmulae  for   Calculating   the   Coefficients   of  y  =  a+bx+cx^.   Based  rn 

Eleven  Points;      x  a  0,    1,   2,   3,    8,    9,    10. 

a  a     -)-4980^y  -  1890Xxy -^ISoZx^y 

8580 

b  S     -1890lly -HlOVsITxy  -  lOoEx^y  > (8) 


c  a    -J-lSOlTy  ,  lOoXxy-t-loilx^y 

8580 


If  the  parabola  is  to  pass  through  the  origin,  then 


a  B  0 

b  =  +55 Zy  -   yZxy 
330 


(9) 


e  a 


-Vly  f>- 


330 


SL 


Pormulae  for  Calculating  the  Coefficients  of  y  S  A-f^B  (x-5)^  C(x-5)2 
+  D(x-5)  ,  Based  on  Eleven  Points ;    (x-5)  s  ~5,  -4,  »-3,  ,  ...♦3,4,5, 


A  a  -|-6408ry  -  560^y(x-5)^ 

30888 


B  =  -/-1865>-y(x^5)+-89S:y(x'»5)^ 

30888 

C  =  ■.36tEy  -f-36i:  y(x-5  )^ 

30888 

D  a  w89r:y(x^5)  -(-5^-y(x~5)^     . 

30888  J 


(10) 
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The  coefficients  rf  y  ■  a  +  bx -j-cx^ -f  dx*^  may  be  rbtained  directly  by 
the  following  set  of  foniiulae;  it  will  be  noted  that  these  still  in- 
volve   (x-5)    B  -5,    -4,    -3,    «..•    4,    5,    and  nrt  x  s   0,    1,    2,    3,    10, 

Fcnnulae   for  Calculating  the   Coefficients    of  y  s  a-Vfcx-tex^-^ix^  Baaed 
on  Eleven  Points;      xS     0,    1,   2,   3,    ,, 7,    9,    10 

a  =     -2592^7  -H800ry(x-5)-h540ry(x-5)^    -  ISfXyCx-S  )'^ 
b   =    -f-SeOoZy  ..  4810.Zry(x-5)    >■  56oEy(x-5  )^ -|-286^y(x-5  )^ 


c  = 


30888 

-SGoHy  4-1535 ^y( x-5) +56 1"y(x..5)^   -  'JdZjjx-b)^ 

30888 


...(11) 


-  "892ry(x-5  )  -h  5SIy(x-5 
30888 


As  a  rule,  it  will  be  simpler  to  use,  net  Formulae  ( 11 ) ,  but 
(10),  then  calculate  the  coefficients  a,  b,  c,  d,  of  y  s  a-Vfoxfcx^-t-dx"^ 
by:   a  =  A  -  5B+25C-1251;   b  =  B-10C  +  75D;   c  «   C  -  15D;  d  =  D. 
If  the  cubic  curve  must  pass  throixgh  the  origin,  then: 


a  a  0  ~^ 

b  s  -2510^y(x-5)-^590l:y(x-5)^-f-56iry(x-5)^ 


303S8 


0  B  -fl085^y(x-5)  -  59iry(x-5)^  -  50^y(x-5)^ 


V 


50838 


d  =  -89<Ly(x-5)  -f5iry(x>-5)^ 
3068S 


(12) 


Or  else,  still  for  the  eleven  points,  x  =  0,  1,  2,  3,  .,,.,,,,  9,  10, 

the  following 


Vf  X^^' 
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a  =   0  A 

b  =  4-56156^7  -  11550 Tyx -^  SSO^yx^ 


51480 


c  2  -11550Jry  f4125  ^  yx  » 

51480 


515^yx^ 


d  =  -f  830Ly  -  315Cyx-|-25^yx^ 


IS. 

51480 


' 


J 


(13) 


Formulae  for  Calculating  the  Coefficients  rf  y  s  AfB(x-5)+C(x-5)2-K5 
(x-5)^4e(x-5)'^.  Baaed  en  Nine  Points;  (x-5)  a  4,  -3,  -2,  -1,  0,  1,  2, 
3,  4. 

A  2  4-154656£ry  -  59960£:y(x~5  )^^1944Z'y(x-5)^ 

375656 


^  "  i-4238f^y(x-5)  -^   3068lly(x~5)^ 

^370656 


C  =  -.39960E:y-M8207^y(x-.5)^  -  1035ry(x-5)^ 

37«656 


D  2  -5068lly(x~5)  -)- 260  ^y(x~5  )^ 
-  370656 


(14) 


E  =  -^1944l.y  -  1035Zy(x~5)^4-65>  y(x-5)^ 

Fcrmulae  for  Calculating  the  Coefficients  of  y  ■  a4-B(x'«5)-V  0(x-5)^-j- 
D(x-5)^4E(x-5)^,  Basod  on  Eleven  Points;   (x-5)  S  -5,  -4,  -3,  -2, 
-1,  0,  1,  2,  3,  4,  5, 
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A  s     441184  ^y^  684f  ry(x-5)  ^-\- 216^7  (x~5)'^ 

123552 

B  ::    -f7460>ry(x-5)    -  556^y(x»-5)^ 

123552 

C   ::     ~684tiryf^019Z"y(x^5)^   ^^^H-jUr-b)^  \ ,.(15) 

123552 

D   =      -556^y(x-.5  )-h20  ry(x>-5  )^ 

f2B%r ;■ — ~ 

E  =    -1-216  ZTy^  75£l7(x-.5)g-4-5l"y(x-5)^ 

rS3B52  ~ 

Formulae  for  Calculating  the   Coefficients   of  y  »     afbxfcx^-l-dx'^fex^, 
Based  on  Eleven  Points ;      x  S      0,    1,   2,   3,    •..••«.    8,   9,   10, 

a  s    -h5184ry-H7gOOXy(x~5)   .■  524o£:  y(x-5)^  -  72tZy(x.-5)^-f 216ly(x-5)^ 

—  123552  '  "^  i 

t   =     ~5960Cg:y  "  19240j:y(x-.5)i-  17510iry(x..5)^-H144r"y(X'-5)^,.75rry(x^5  )^/ 
■  123552  '  ' 

c  s    4-25560ry-/5340i:y(x-5)   ^  925]Xy(x-5)^  -SOOryCx-S  )'^f  575Ey(x..5  )^     U 

123552 

d  S      ^4520^y  ~  556Zy(x>.5  )-V-1500^y(x-5  )S-h20iry(x~5  )^   ^  6oEy(x«5)^ 

^ggggg  -  — 

P   a    -f-216ry  >■  75£y(x~5)g-h5ry(x>-5)^ 

-L23552 

If   this  curve  must  pass   through   the   origin,    then: 
at:        0 

b  S     ■.21600l:yV5760Ey(x^5)-i^060^y(x~5)^  ^  1556>^y(x~5  )^ 

3L23552  ~~" 

c   «    -f  16560iry  ~  7160Fy(x^5)    -  5606$ry(x-5)^4-950Xy(x>'5)^l    .(17) 

123552 

1  5     ~2880iry-|-1644Zy(x-.5)-^600i:y(x-5)^  ^  18rry(x~5)^ 

12355g 

©   =    ^144£y  -  100Ey(x~5)    -  5o£"y(x>5)S  -f- icgry(x^5)^ 
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Til©  saving  of  labor  effected  "by  our  procedure  will  of  course 
grow  rapidly  with  the  degree  of  the  equation  desired. 
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